Search CORE

4 research outputs found

Open-ended continuous reinforcement learning for mobile robots

Author: Dhakan Paresh
Publication venue
Publication date: 01/05/2022
Field of study

Intrinsic Rewards for Maintenance, Approach, Avoidance and Achievement Goal Types

Author: Dhakan Paresh
Merrick Kathryn
Rano Ignacio
Siddique N
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors

Directory of Open Access Journals

Frontiers - Publisher Connector

Ulster University's Research Portal

Concurrent Skill Composition using Ensemble of Primitive Skills

Author: Dhakan Paresh
Kasmarik Kathryn
Rano Inaki
Siddique Nazmul
Vance Philip
Publication venue
Publication date: 11/12/2023
Field of study

One of the key characteristics of an open-ended cumulative learning agent is that it should use the knowledge gained from prior learning to solve future tasks. That characteristic is especially essential in robotics, as learning every perception-action skill from scratch is not only time consuming but may not always be feasible. In the case of reinforcement learning, this learned knowledge is called a policy. The lifelong learning agent should treat the policies of learned tasks as building blocks to solve those future tasks. One of the categorizations of tasks is based on its composition, ranging from primitive tasks to compound tasks that are either a sequential or concurrent combination of primitive tasks. Thus, the agent needs to be able to combine the policies of the primitive tasks to solve compound tasks, which are then added to its knowledge base. Inspired by modular neural networks, we propose an approach to compose policies for compound tasks that are concurrent combinations of disjoint tasks. Furthermore, we hypothesize that learning in a specialized environment leads to more efficient learning; hence, we create scaffolded environments for the robot to learn primitive skills for our mobile robot-based experiments. We then show how the agent can combine those primitive skills to learn solutions for compound tasks. That reduces the overall training time of multiple skills and creates a versatile agent that can mix and match the skills.</p

Ulster University's Research Portal

A Review of the Relationship between Novelty, Intrinsic Motivation and Reinforcement Learning

Author: Dhakan Paresh
Merrick Kathryn
Rano Inaki
Siddique Nazmul
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/11/2017
Field of study

This paper presents a review on the tri-partite relationship between novelty, intrinsic motivation and reinforcement learning. The paper first presents a literature survey on novelty and the different computational models of novelty detection, with a specific focus on the features of stimuli that trigger a Hedonic value for generating a novelty signal. It then presents an overview of intrinsic motivation and investigations into different models with the aim of exploring deeper co-relationships between specific features of a novelty signal and its effect on intrinsic motivation in producing a reward function. Finally, it presents survey results on reinforcement learning, different models and their functional relationship with intrinsic motivation

Directory of Open Access Journals